Markov decision process

Results: 537



#Item
491research highlights doi:[removed][removed]Technical Perspective The Ultimate Pilot Program By Stuart Russell and Lawrence Saul

research highlights doi:[removed][removed]Technical Perspective The Ultimate Pilot Program By Stuart Russell and Lawrence Saul

Add to Reading List

Source URL: www.cs.berkeley.edu

Language: English - Date: 2010-05-29 04:46:06
492Writing Stratagus-playing Agents in Concurrent ALisp Bhaskara Marthi, Stuart Russell, David Latham Department of Computer Science University of California Berkeley, CA 94720 {bhaskara,russell,latham}@cs.berkeley.edu

Writing Stratagus-playing Agents in Concurrent ALisp Bhaskara Marthi, Stuart Russell, David Latham Department of Computer Science University of California Berkeley, CA 94720 {bhaskara,russell,latham}@cs.berkeley.edu

Add to Reading List

Source URL: www.cs.berkeley.edu

Language: English - Date: 2005-05-26 13:17:00
493Selecting Computations: Theory and Applications  Nicholas Hay and Stuart Russell Computer Science Division University of California Berkeley, CA 94720

Selecting Computations: Theory and Applications Nicholas Hay and Stuart Russell Computer Science Division University of California Berkeley, CA 94720

Add to Reading List

Source URL: www.cs.berkeley.edu

Language: English - Date: 2012-10-04 09:08:48
494Programmable Reinforcement Learning Agents  David Andre and Stuart J. Russell Computer Science Division, UC Berkeley, CA[removed]dandre,russell @cs.berkeley.edu

Programmable Reinforcement Learning Agents David Andre and Stuart J. Russell Computer Science Division, UC Berkeley, CA[removed]dandre,russell @cs.berkeley.edu

Add to Reading List

Source URL: www.cs.berkeley.edu

Language: English - Date: 2006-01-06 14:54:10
495RAPID: A Reachable Anytime Planner for Imprecisely-sensed Domains  Emma Brunskill Computer Science Department University of California, Berkeley Berkeley, CA

RAPID: A Reachable Anytime Planner for Imprecisely-sensed Domains Emma Brunskill Computer Science Department University of California, Berkeley Berkeley, CA

Add to Reading List

Source URL: www.cs.berkeley.edu

Language: English - Date: 2010-06-12 13:34:38
496Concurrent Hierarchical Reinforcement Learning Bhaskara Marthi, Stuart Russell, David Latham Computer Science Division University of California Berkeley, CA 94720 {bhaskara,russell,latham}@cs.berkeley.edu

Concurrent Hierarchical Reinforcement Learning Bhaskara Marthi, Stuart Russell, David Latham Computer Science Division University of California Berkeley, CA 94720 {bhaskara,russell,latham}@cs.berkeley.edu

Add to Reading List

Source URL: www.cs.berkeley.edu

Language: English - Date: 2005-04-15 21:04:50
497State Abstraction for Programmable Reinforcement Learning Agents David Andre and Stuart J. Russell Computer Science Division, UC Berkeley, CA[removed]fdandre,[removed]

State Abstraction for Programmable Reinforcement Learning Agents David Andre and Stuart J. Russell Computer Science Division, UC Berkeley, CA[removed]fdandre,[removed]

Add to Reading List

Source URL: www.cs.berkeley.edu

Language: English - Date: 2008-01-03 13:48:15
498Q-Decomposition for Reinforcement Learning Agents  Stuart Russell @.. Andrew L. Zimdars @..

Q-Decomposition for Reinforcement Learning Agents Stuart Russell @.. Andrew L. Zimdars @..

Add to Reading List

Source URL: www.cs.berkeley.edu

Language: English - Date: 2003-06-03 00:44:40
499Multi-armed Bandit Problems with Dependent Arms  Sandeep Pandey

Multi-armed Bandit Problems with Dependent Arms Sandeep Pandey

Add to Reading List

Source URL: www.cs.cmu.edu

Language: English - Date: 2007-06-18 15:15:12
500Journal of Artificial Intelligence Research[removed]472  Submitted[removed]; published[removed]

Journal of Artificial Intelligence Research[removed]472 Submitted[removed]; published[removed]

Add to Reading List

Source URL: www.cs.tufts.edu

Language: English - Date: 2008-03-25 22:30:40